Web Sessions Clustering with Artificial Ants Colonies
نویسندگان
چکیده
In this paper, we present AntClust, an ant based clustering algorithm and its application to the Web usage mining problem. We define a Web session as a weighted multi-modal vector and we also develop a similarity measure between two sessions. We show that the partitions found by AntClust are stable on a data set made of real sessions extracted from a Web site of the University of Tours. Contrary to some other studies, we do not only consider the transactions model to describe the sessions. We show that our algorithm performs well and is able to find non-noisy clusters when dealing with sessions defined by a vector containing the number of hits recorded for each of the Web page.
منابع مشابه
AntClust: Ant Clustering and Web Usage Mining
In this paper, we propose a new ant-based clustering algorithm called AntClust. It is inspired from the chemical recognition system of ants. In this system, the continuous interactions between the nestmates generate a “Gestalt” colonial odor. Similarly, our clustering algorithm associates an object of the data set to the odor of an ant and then simulates meetings between ants. At the end, artif...
متن کاملAntTree: A Web Document Clustering Using Artificial Ants
We present in this work a new algorithm for document hierarchical clustering and automatic generation of portals sites. This model is inspired from the self-assembling behavior observed in real ants where ants progressively get attached to an existing support and successively to other attached ants. The artificial ants that we have defined will similarly build a tree. Each ant represents one do...
متن کاملLearning Web Users Profiles With Relational Clustering Algorithms
In the context of web personalization and dynamic content recommendation, it is crucial to learn typical user profiles. Although there exists several approaches to mine user profiles (such as association rules or sequential patterns extraction), this paper focuses on the application of relational clustering algorithms on web usage data to characterize user access profiles. These methods rely on...
متن کاملPredicting web user behavior using learning-based ant colony optimization
An ant colony optimization-based algorithm to predict web usage patterns is presented. Our methodology incorporates multiple data sources, such as web content and structure, as well as web usage. The model is based on a continuous learning strategy based on previous usage in which artificial ants try to fit their sessions with real usage through the modification of a text preference vector. Sub...
متن کاملApplication of Ant-based Template Matching for Web Documents Categorization
The self-organization behavior exhibited by ants may be modeled to solve real world clustering problems. The general idea of artificial ants walking around in search space to pick up, or drop an item based upon some probability measure has been examined to cluster a large number of World Wide Web (WWW) documents. However, this idea is extended with the direct application of template matching wi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003